ETSITS126 104V5.2.0 



(2003-06) 



Technical Specification 



Digital cellular telecommunications system (Phase 2+); 

Universal Mobile Telecommunications System (UMTS); 

ANSI-C code for the floating-point 
Adaptive Multi-Rate (AMR) speech codec 
(3GPP TS 26.104 version 5.2.0 Release 5) 



33i^ 



GS 




® 



GLOBAL SYSTEM FOR 
MOBILE COMMUNICATIONS 





u 



3GPP TS 26.1 04 version 5.2.0 Release 5 1 ETSI TS 1 26 1 04 V5.2.0 (2003-06) 



Reference 



RTS/TSGS-04261 04v520 
Keywords 



GSM, UMTS 



ETSI 

650 Route des Lucioles 
F-06921 Sophia Antipolis Cedex - FRANCE 

Tel. : +33 4 92 94 42 00 Fax: +33 4 93 65 47 1 6 

Siret N ° 348 623 562 0001 7 - NAF 742 C 
Association a but non lucratif enregistree a la 
Sous-Prefecture de Grasse (06) N° 7803/88 



Important notice 



Individual copies of the present document can be downloaded from: 
http://www.etsi.orq 

The present document may be made available in more than one electronic version or in print. In any case of existing or 

perceived difference in contents between such versions, the reference version is the Portable Document Format (PDF). 

In case of dispute, the reference shall be the printing on ETSI printers of the PDF version kept on a specific network drive 

within ETSI Secretariat. 

Users of the present document should be aware that the document may be subject to revision or change of status. 

Information on the current status of this and other ETSI documents is available at 

http://portal.etsi.orq/tb/status/status.asp 

If you find errors in the present document, send your comment to: 
editor(a)etsi.orq 

Copyright Notification 

No part may be reproduced except as authorized by written permission. 
The copyright and the foregoing restriction extend to reproduction in all media. 

© European Telecommunications Standards Institute 2003. 
All rights reserved. 

DECT'^", PLUGTESTS™ and UMTS™ are Trade Marks of ETSI registered for the benefit of its Members. 
TIPHON^" and the TIPHON logo are Trade Marks currently being registered by ETSI for the benefit of its Members. 
2QppTM |g g jracle Mark of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. 



ETSI 



3GPP TS 26.1 04 version 5.2.0 Release 5 2 ETSI TS 1 26 1 04 V5.2.0 (2003-06) 



Intellectual Property Rights 



IPRs essential or potentially essential to the present document may have been declared to ETSI. The information 
pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found 
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This Technical Specification (TS) has been produced by ETSI 3rd Generation Partnership Project (3GPP). 

The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or 
GSM identities. These should be interpreted as being references to the corresponding ETSI deliverables. 
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Foreword 



rd , 



This Technical Specification (TS) has been produced by the 3 Generation Partnership Project (3GPP). 

The contents of the present document are subject to continuing work within the TSG and may change following formal 
TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an 
identifying change of release date and an increase in version number as follows: 

Version x.y.z 

where: 

X the first digit: 

1 presented to TSG for information; 

2 presented to TSG for approval; 

3 or greater indicates TSG approved document under change control. 

y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, 
updates, etc. 

z the third digit is incremented when editorial only changes have been incorporated in the document. 
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Scope 



This Technical Standard (TS) contains an electronic copy of the ANSI-C code for a floating-point implementation of 
the Adaptive Multi-Rate codec. This floating-point codec specification is mainly targeted to be used in multimedia 
applications such as the 3G-324M terminal specified in 3GPP TS 26. 1 10, or in packet-based (e.g., H.323) applications. 
The bit-exact fixed-point ANSI-C code in 3GPP TS 26.073 remains the preferred implementation for all applications, 
but the floating-point codec may be used instead of the fixed-point codec when the implementation platform is better 
suited for a floating-point implementation. It has been verified that the fixed-point and floating-point codecs 
interoperate with each other without any artifacts. 

The floating-point ANSI-C code in this specification is the only standard conforming non-bit-exact implementation of 
the Adaptive Multi Rate speech transcoder (3GPP TS 26.090 [2]), Voice Activity Detection (3GPP TS 26.094 [6]), 
comfort noise generation (3GPP TS 26.092 [4]), and source controlled rate operation (3GPP TS 26.093 [5]). The 
floating-point code also contains example solutions for substituting and muting of lost frames (3GPP TS 26.091 [3]). 

The fixed-point specification in 26.073 shall remain the only allowed implementation for the 3G mandatory 
speech service and the use of the floating-point codec is strictly limited to other services. 

The floating-point encoder in this specification is a non-bit-exact implementation of the fixed-point encoder producing 
quality indistinguishable from that of the the fixed-point encoder. The decoder in this specification is functionally a 
bit-exact implementation of the fixed-point decoder, but the code has been optimized for speed and the standard fixed- 
point libraries are not used as such. 



Normative references 



The following documents contain provisions which, through reference in this text, constitute provisions of the present 
document. 

• References are either specific (identified by date of publication, edition number, version number, etc.) or 
non-specific. 

• For a specific reference, subsequent revisions do not apply. 

• For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including 
a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same 
Release as the present document. 

[1] 3GPP TS 26.074: "AMR Speech Codec; Test sequences". 

[2] 3GPP TS 26.090: "AMR Speech Codec; Speech transcoding". 

[3] 3GPP TS 26.091: "AMR Speech Codec; Substitution and muting of lost frames". 

[4] 3GPP TS 26.092: "AMR Speech Codec; Comfort noise aspects". 

[5] 3GPP TS 26.093: "AMR Speech Codec; Source controlled rate operation". 

[6] 3GPP TS 26.094: "AMR Speech Codec; Voice Activity Detection". 

[7] 3GPP TS 26.073: "ANSI-C code for the Adaptive Multi Rate speech codec". 

[8] 3GPP TS 26.101: "AMR Speech Codec Frame Structure". 

[9] RFC 3267 "A Real-Time Transport Protocol (RTP) Payload Format and File Storage Format for 

Adaptive Multi-Rate (AMR) and Adaptive Multi-Rate Wideband (AMR-WB) Audio Codecs, June 
2002. 
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3 Definitions and abbreviations 

3.1 Definitions 

Definition of terms used in the present document, can be found in 3GPP TS 26.090 [2], 3GPP TS 26.091 [3], 3GPP 
TS 26.092 [4], 3GPP TS 26.093 [5], and 3GPP TS 26.094 [6]. 

3.2 Abbreviations 

For the purpose of the present document, the following abbreviations apply: 

ANSI American National Standards Institute 

ETS European Telecommunication Standard 

GSM Global System for Mobile communications 

I/O Input/Output 

RAM Random Access Memory 

ROM Read Only Memory 



C code structure 



This clause gives an overview of the structure of the floating-point C code and provides an overview of the contents and 
organization of the C code attached to this document. The basic structure of the floating-point C code follows that of the 
bit-exact fixed-point code [7] . 

The C code has been verified on the following systems: 

IBM PC/AT compatible computers with Windows NT40 and Microsoft Visual C++ v.5.0 compiler; 

HP workstations and GNU gcc compiler; 

IBM PC/AT compatible computers with Linux operating system and GNU gcc compiler; 
ANSTC 9899 was selected as the programming language because portability was desirable 

4.1 Contents of the C source code 

The C code distribution has all files in the root level. 

The files with suffix "c" contain the source code and the files with suffix "h" are the header files. The ROM data is 
contained in "rom" files with suffix "h". 

The C code does not contain any speech coder installation verification data files. Verification for the bit-exact decoder 
is defined in specification 3GPP TS 26.073 [7]. 

Makefiles are provided for the platforms in which the C code has been verified (listed above). Once the software is 
installed, this directory will have a compiled version of encoder and decoder and all the object files. 
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4.2 Program execution 

The Adaptive Multi-Rate codec is implemented in two programs: 

(encoder) speech encoder; 

(decoder) speech decoder. 

The programs should be called hke: 

encoder [-dtx] mode speech_file bitstream_file 

or 

encoder [-dtx] -modefile=mode_file speech_file bitstream_file 

decoder <parameter file> <speech output file> 

The speech files contain 16-bit linear encoded PCM speech samples and the parameter files contain encoded speech 
data and some additional flags. 

See the file readme.txt for more information on how to run the encoder and decoder programs. 



4.3 Coding style 



The C code has been written according to structuring conventions used in 3GPP TS 26.073 [7]. Encoder and decoder 
state structures are allocated and initialized with special initializing functions. There are no separate functions for each 
module, as opposed to the fixed-point implementation in 3GPP TS 26.073 [7]. 

4.4 Code hierarchy 

The code hierarchy follows the one specified in 3GPP TS 26.073 [7]. 

Figures 1 to 4 are call graphs that show the functions used in the speech codec, including the functions of VAD, DTX, 
and comfort noise generation. 

Each column represents a call level and each cell a function. The functions contain calls to the functions in rightwards 
neighboring cells. The time order in the call graphs is from the top downwards as the processing of a frame advances. 
All standard C functions, such as printf(), fwrite(), etc., have been omitted. 

The encoder call graph is broken down into three separate call graphs, shown in Tables 1 to 3. 
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Table 1 : Speech encoder call structure 



Speech_Encode__Frame 



Pre_Process 



tx dtx handler 



Ipc 



Isp 



checkjsp 



pre_big 



oljtp 



vad_pitch_detection 



subframePreProc 



cljtp 



cbsearch 



gainQuant 



update_gp_clipping 



subframePostProc 



Pred_lt_3or6 



Convolve 



filter_bank 



vad_decision 



fir5t_filter_stage 



filters 



filters 



levei_calculation 



complex_e5timate_adapt 



complex_vad 



noise_estimate_update 



hangover_addition 



update_cntrl 



Azjsp 



Q_plsf_5 



lnt_lpc_1 and3_2 



lnt_ipc_1 and3 



Q_plsf_3 



lnt_lpc_1to3_2 



lnt_lpc_1to3 



Dotproduct40 



Lspjsf 



Reorderjsf 



Lsfjsp 



Q_plsf_3 



Chebps 



Lspjsf 



Lsf_wt 



Vq_subvec 



Vq_subvec_ 



Reorderjsf 



Lsfjsp 



Lsp_az 



Lsp_az 



Lspjsf 



Lsf_wt 



Vq_subvec3 



Vq_subvec4 



Reorderjsf 



Lsfjsp 



Lsp_az 



Lsp_az 



GetJsp_pol 



GetJsp_pol 



GetJsp_pol 



GetJsp_pol 



Lspjsf 



Lsf_wt 



Vq_subvec3 



Vq_subvec4 



Reorderjsf 



Lsfjsp 



Weight_Ai 



Residu 



Synjilt 



Pitch_ol 



Pitcfi_ol_wgh 



vadJone_detection_update 



Lag_max 



comp_corr 



hp_max 



comp_corr 



Lag_max_wght 



gmed_n 



fip_max^ 



vadJone_detection 



vadJone_detection_update 



vadJone_detection 



Weigh t_Ai 



Synjilt 



Residu 



Pitch Jr 



G_pitch 



check_gp_clipping 



q_gain_pitch 



see Table 2 



see Table 3 



Copy 



Synjilt 



getRange 



Norm_Corr 



Encjag3 



Encj 



Dotproduct4Q 



lnterpol_3or6 



Dotproduct40 
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Table 2: cbsearch call structure 



cbsearch 


code_2i40_9bits 


cor h X 


Dotproduct40 | 


set sign 




cor h 


Dotproduct40 | 


search 2i40 9bits 




build code 2i40 9bits 


code_2i40_11bits 


cor h X 


Dotproduct40 | 


set sign 




cor h 


Dotproduct40 | 


search 2i40 11 bits 




build code 2i40 11 bits 


code_3i40_14bits 


cor h X 


Dotproduct40 | 


set sign 




cor h 


Dotproduct40 | 


search 3i40 




build code 3i40 14bits 


code_4i40_17bits 


cor h X 


Dotproduct40 | 


set sign 




cor h 


Dotproduct40 | 


search 4i40 




build code 4i40 




code_8i40_31bits 


cor h X 


Dotproduct40 


set sign12k2 


Dotproduct40 


cor h 


Dotproduct40 


search 8i40 




build code 8i40 31 bits 




compress code 


compresslO | 


code_10i40_35bits 


cor h X 


Dotproduct40 


set sign12k2 


Dotproduct40 


cor h 


Dotproduct40 


search 10i40 




build code 10140 35bits 




q_P 





Table 3: gainQuant call structure 



gainQuant 


gc pred 


Dotproduct40 






caic tilt energies 


Dotproduct40 




Dotproduct40 






MR475 update unq pred 




MR475 gain quant 


gc pred 


Dotproduct40 


q gain code 






MR795_gain_quant 


q gain pitch 




MR795 gain code quant3 




calc unfilt energies 


Dotproduct40 


gain adapt 


Gmed n f 


MR795 gain code quant mod 




Qua_gain 
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Table 4: Speech decoder call structure 



Speech_Decode_Frame 


Decoderamr 


rx dtx handler 








Decoder amr reset 






dtx_dec 


Copy 






Lsf Isp 




D pIsf 3 


Lsf Isp 




pseudonoise 






Lsp lsf 




Reorder lsf 




Lsp Az 


Get Isp pol 




A Refl 






Log2 


Log2 norm 




Pow2 






Build CN code 


pseudonoise 




Syn tilt 






Lsf Isp 






Isp avg 




Build CN param 




D pisf 3 


Lsf Isp 




Int Ipc 1to3 


Lsp Az 


Get Isp pol 




D_plsL5 


Reorder lsf 






Lsf Isp 




Int Ipc lands 


Lsp Az 


Get Isp pol 




Dec Iag3 








Pred It 3or6 40 




Dec lags 




decode 2140 9bits 




decode 2140 11 bits 




decode 3140 Ubits 




decode 4140 17bits 




decode 8140 31 bits 


decompress codewords 


decompresslO 




ec gain pitch 


gmed n 






d gain pitch 






ec gain pitch update 




decode 10i40 35bits 




Dec^gain 


Log2 


Log2 norm 




gc_pred 


Log2 


Log2 norm | 


Log2 norm 




Pow2 






gc pred update 




ec_gain_code 


gmed n 




gc pred average limited 




gc pred update 




ec gain code update 






d_gain_code 


gc_pred 


Log2 


Log2 norm | 


Log2 norm 




Pow2 






gc pred update 




Int lsf 






Cb gain average 




ph disp 




sqrt 1 exp 




Ex Ctrl 


gmed n 




agc2 


Inv sqrt 




Syn tilt 






Bgn scd 


gmed n 




dtx_dec_activity_update 


Copy 




Log2 


Log2 norm 




Isp avg 








Post_Filter 


Residu40 




Syn tilt 




age 


energy new 


energy old 




Inv sqrt 






Post Process 









4.5 



Variables, constants and tables 



The data types of variables and tables used in the floating-point implementation are signed integers in 2's complement 
representation, defined by: 

Words 8 bit variable 
UWordS 8 bit unsigned variable 

Wordl6 16 bit variable 
Word32 32 bit variable 

Floating-point numbers use the IEEE (Institute of Electrical and Electronics Engineers) format: 

Float32 8 bit exponent, 23 bit mantissa, 1 bit sign 

Float64 1 1 bit exponent, 52 bit mantissa, 1 bit sign 
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Furthermore some enum types are used, all possible to represent with one byte, and a boolean Flag. 

4.5.1 Description of constants used in tine C code 

Constants for the codec are defined in rom (h) files. 



4.5.2 Description of fixed tables used in tine C code 

This section contains a listing of all fixed tables sorted by source file name and table name. 

Table 5: Speech encoder fixed tables 



File 



Table name 



Type[Length] 



Description 



rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 

rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom_enc.h 
rom enc.h 



trackTable 

gammal 

gamma1_12k2 

gamma2 

b60 

startPosI 

startPos2 

startPos 

corrweight 

qua_gain_pitch 

qua_gain_pitch_l\/IR1 2 

2 

qua_gain_code 

gray 

grid 

b24 

lag_wind 

lsp_init_data 

past_rq_init 

mean_lsf_3 

mean_lsf_5 

predjac 

dico1_lsf_3 

dico2_lsf_3 

dico3Jsf_3 

mr515_3_lsf 

mr795_1Jsf 

dico1Jsf_5 

dico2Jsf_5 

dico3_lsf_5 

dico4_lsf_5 

dico5_lsf_5 

table_gain_MR475 

table_gain_highrates 

table_gainJowrates 

window_200_40 

window_160_80 

window_232_8 

corrweight 

mode_dep_parm 



Word8[4*5] track table for algebraic code book search (MR475, MR51 5) 

Float32[1 0] spectral expansion factors 

Float32[1 0] spectral expansion factors 

Float32[1 0] spectral expansion factors 

Float32]61 ] interpolation filter coefficients 

Word1 6[2] track start search position for first pulse 

Word1 6[4] track start search position for second pulse 

Word 1 6[1 6] track start search position 

Float32[251 ] weighting of the correlation function in open loop LTP search (MR1 02) 

Float32[1 6] adaptive codebook gain quantization table (MR795) 

Float32[1 6] adaptive codebook gain quantization table (MR1 22) 

Float32[64] fixed codebook gain quantization table (MR1 22, MR795) 

Word8[8] gray coding table 

Float32[61 ] grid points at wich Chebyshev polynomials are evaluated 

Float32[25] interpolation filter coefficients 

Float32[1 0] lag window table 

Float32[10] initialization table for Isp history in DTX 

Float32[80] initialization table for the MA predictor in DTX 

Float32[1 0] LSF means (not in MR1 22) 

Float32[1 0] LSF means (MR1 22) 

Float32[1 0] LSF prediction factors (not in MR1 22) 

Float32[3*256] 1" LSF quantizer (not in MR122 and MR795) 

Float32[3*512] 2"" LSF quantizer (not in MR122) 

Float32[4*512] 3'" LSF quantizer (not in l\/IR122, MR515 and MR475) 

Float32[4*128] 3'" LSF quantizer (MR515 and MR475) 

Float32[3*51 2] 1 '' LSF quantizer (MR795) 

Float32[4*1 28] 1 " LSF quantizer (MR1 22) 

Float32[4*256] 2"" LSF quantizer (MR122) 

Float32[4*256] 3'" LSF quantizer (MR122) 

Float32[4*256] 4"" LSF quantizer (MR122) 

Float32[4*64] 5" LSF quantizer (MR1 22) 

Float32[4*256] gain quantization table (MR475) 

Float32[128*3] gain quantization table (MR67, MR74 and MR102) 

Float32[64*3] gain quantization table (MR51 5 and MR59) 

Float32[240] LP analysis window (not in IVIRI 22) 

Float32[240] 1 " LP analysis window (MR1 22) 

Float32[240] 2"" LP analysis window (IVIRI 22) 

Float32[251] correlation weights 

Word8[8*9] parameters defining the adaptive codebook search per mode 
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Table 6: Speech decoder fixed tables 



File 



Table name 



Type[Length] 



Description 



rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom dec.h 



dtx_log_en_adjust 

cdown 

pdown 

pred 

pred_MR122 

gamma3_MR122 

gammas 

gamma4_MR122 

gamma4 

bitno_MR475 

bitno_MR515 

bitno_MR59 

bitno_MR67 

bitno_MR74 

bitno_MR795 

bitno_MR102 

bitno_MR122 

bitno_MRDTX 

qua_gain_pitch 

qua_gain_code 

gray 

dgray 

sqrMable 

inv_sqrt_table 

log2_table 

pow2_table 

cos_table 

acos_slope 

ph_imp_low_MR795 

ph_imp_mid_MR795 

phjmpjow 

ph_imp_mid 

past_rq_init 

mean_lsf_3 

mean_lsf_5 

predjac 

dico1_lsf_3 

dico2_lsf_3 

dico3_lsf_3 

mr515_3_isf 

mr795_1_lsf 

dico1_lsf_5 

dico2_lsf_5 

dico3_lsf_5 

dico4_lsf_5 

dico5_lsf_5 

table_gain_MR475 

table_gain_highrates 

table_gain_lowrates 

inter_6 

window_200_40 

table_speech_bad 

table_SID 

table_DTX 

table mute 



Word16[9] level adjustments for ech mode 
Word32[7] attenuation factors for codebook gain 
Word32[7] attenuation factors for adaptive codebook gain 
Word32[4] algebraic code book gain IVIA predictor coefficients 
Word32[4] algebraic code book gain MA predictor coefficients (MR122) 
Word32[1 0] spectral expansion factors 
Word32[1 0] spectral expansion factors 
Word32[1 0] spectral expansion factors 
Word32[1 0] spectral expansion factors 
Word16[17] number of bits per parameter to transmit (MR475) 
Word16[19] number of bits per parameter to transmit (MR515) 
Word16[19] number of bits per parameter to transmit (MR59) 
Word16[19] number of bits per parameter to transmit (MR67) 
Word16[19] number of bits per parameter to transmit (MR74) 
Word16[23] number of bits per parameter to transmit (MR795) 
Word16[39] number of bits per parameter to transmit (MR102) 
Word16[57] number of bits per parameter to transmit (MR122) 
Word1 6[5] number of bits per parameter to transmit (MRDTX) 
Word32[1 6] adaptive codebook gain quantization table (MR1 22, MR795) 
Word32[96] fixed codebook gain quantization table (MR1 22, MR795) 
Word8[8] gray coding table 
Word8[8] gray decoding table 
Word32[49] table to compute sqrt(x) 
Word32[49] table used in inverse square root computation 
Word32[33] table used inbase 2 logharithm computation 
Word32[33] table used in 2 to the power computation 
Word32[65] table to compute cos(x) in Lsf_lsp() 
Word32[64] table to compute acos(x) in Lsp_lsf() 
Word32[40] pfiase dispersion impulse response (MR795) 
Word32[40] pfiase dispersion impulse response (MR795) 
Word32[40] pfiase dispersion impulse response (1\/!R475 - MR67) 
Word32[40] pfiase dispersion impulse response (MR475 - MR67) 
Word32[80] initialization table for the MA predictor in DTX 
Word32[1 0] LSF means (not in MR1 22) 
Word32[1 0] LSF means (MR1 22) 
Word32[1 0] LSF prediction factors (not in MR1 22) 

Word32[3*256]] 1" LSF quantizer (not in MR122 and MR795) 

Word32[3*512] 2"'' LSF quantizer (not in MR122) 

Word32[4*512] 3'" LSF quantizer (not in MR122, MR515 and MR475) 

Word32[4*128] 3'" LSF quantizer (MR515 and MR475) 

Word32[3*51 2] 1 " LSF quantizer (MR795) 

Word32[4*1 28] 1 " LSF quantizer (MR1 22) 

Word32[4*256] 2"" LSF quantizer (MR122) 

Word32[4*256] 3'" LSF quantizer (MR122) 

Word32[4*256] 4"' LSF quantizer (MR122) 

Word32[4*64] S'" LSF quantizer (MR1 22) 

Word32[4*256] gain quantization table (MR475) 

Word32[128*4] gain quantization table (MR67, MR74 and MR102) 

Word32[64*4] gain quantization table (MR515 and MR59) 
Word32[61] interpolation filter coefficients 
Word32[240] LP analysis window (not in MR1 22) 
UWord8[9] comparison optimisation table in DTX 
Uword8[9] comparison optimisation table in DTX 
Uword8[9] comparison optimisation table in DTX 
Uword8[9] comparison optimisation table in DTX 



ETSI 



3GPP TS 26.104 version 5.2.0 Release 5 



13 



ETSI TS 126 104 V5.2.0 (2003-06) 



4.5.3 Static variables used in tine C code 

In this section, two tables that specify the static variables for the speech encoder and decoder, respectively, are shown. 
All static variables are declared within a C struct. 

Table 7: Speech encoder static variables 



struct name 


Variable 


Type[Lengtli] 


Description 


Speech_Encode_ 


cod_amr_state 


cod_amrState 


see below in this table 


FrameState 










pre state 


Pre ProcessState 


see below in this table 




dtx 


Word32 


Is set if DTX functionality is used 


Pre_ProcessState 


y2 


Float32 


filter state 




yi 


Word16Float32 


filter state 




xO 


Float32 


filter state 




x1 


Float32 


filter state 


cod_amrState 


old_speech 


Float32 [320] 


speech buffer 




speech 


Float32* 


pointer to current frame in old_speech 




p_window 


Float32* 


pointer to LPC analysis window in old_speech 




p_window_12k2 


Float32* 


pointer to LPC analysis window with no lookahead in 
old_speech(MR122) 




new_speech 


Float32* 


pointer to the last 160 speech samples in old_speech 




old_wsp 


Float32 [303] 


buffer holding spectral weighted speech 




wsp 


Float32* 


pointer to the current frame in old_wsp 




oldjags 


Word32[5] 


open loop LTP states 




oLgainJIg 


Float32 [2] 


enables open loop pitch lag weighting (MR102) 




old_exc 


Float32[314] 


excitation vector 




exc 


Float32* 


current excitation 




ai_zero 


Float32[51] 


history of weighted synth. filter followed by zero vector 




zero 


Float32* 


zero vector 




h1 


Float32* 


impulse response of weighted synthesis filter 




hvec 


Float32 [80] 


zero vector followed by impulse response 




IpcSt 


IpcState 


see below in this table 




IspSt 


IspState 


see below in this table 




cILtpSt 


cILtpState 


see below in this table 




gainQuantSt 


gainQuantState 


see below in this table 




pitchOLWghtSt 


pitchOLWghtState 


see below in this table 




tonStabSt 


tonStabState 


see below in this table 




vadSt 


vadState 


see below in this table 




vadSt2 


vadState2 


see below in this table 




dtx 


Word32 


is set if DTX functionality is used 




dtx_encSt 


dtx encState 


see below in this table 




mem_syn 


Float32[10] 


synthesis filter memory 




mem_wO 


Float32[10] 


weighting filter memory (applied to error signal) 




mem_w 


Float32[10] 


weighting filter memory (applied to input signal) 




mem_err 


Float32 [50] 


filter memory for production of error vector 




error 


Float32* 


error signal (input minus synthesized speech) 




sharp 


Float32 


pitch sharpening gain 


vadState 


bckr_est 


Float32 [9] 


background noise estimate 




avejevel 


Float32 [9] 


averaged input components for stationary estimation 




oldjevel 


Float32 [9] 


input levels of the previous frame 




subjevel 


Float32 [9] 


input levels calculated at the end of a frame (lookahead) 




a_data5 


Float32 [6] 


memory for the filter bank 




a_data3 


Float32 [5] 


memory for the filter bank 




burst_count 


Word 16 


counts length of a speech burst 




hang_count 


Word 16 


hangover counter 




stat_count 


Word 16 


stationary counter 




vadreg 


Word32 


1 5 flags for intermediate VAD decisions 




pitch 


Word32 


1 5 flags for pitch detection 




tone 


Word 16 


15 flags for tone detection 




connplex_high 


Word 16 


flags for complex detection 




complexjow 


Word 16 


flags for complex detection 




oldlag_count 


Word32 


variables for pitch detection 




oldlag 


Word32 


variables for pitch detection 




connplex_hang_count 


Word 16 


complex hangover counter, used by VAD 




complex_hang_timer 


Word 16 


hangover initiator, used by CAD 
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Struct name 


Variable 


Type[Lengtli] 


Description 




best_corr_hp 

speech_vad_decision 

complex_warning 

sp_burst_count 

corr_hp_fast 


Float32 
Word 16 
Word 16 
Word 16 
Word 16 


filtered value 

final decision 

complex background warning 

counts length of a speech burst inci HO addition 

filtered value 


dtx_encState 


lsp_hist 
log_en_hist 
hist_ptr 
log_en_index 

init_lsf_vq_index 
Ispjndex 

dtxHangoverCount 
decAnaElapsedCount 


Float32[80] 
Float32 [8] 
Word 16 
Word 16 

Word32 
Word16[3] 
Word 16 
Word 16 


LSP history (8 frames) 
logarithmic frame energy history (8 frames) 
pointer to the cyclic history vectors 
Index for logarithmic energy 

initial index for Isf predictor 
Isp indecies to the three code books 
is decreased in DTX hangover period 
counter for elapsed speech frames in DTX 


IpcState 


LevinsonSt 


LevinsonState 


see below 


LevinsonState 


old A 


Float32[11] 


last frames direct form coefficients 


IspState 


lsp_old 

lsp_old_q 

qSt 


Float32[10] 
Float32[10] 
Q pIsfState 


old LSP vector 

old quantized LSP vector 

see below in this table 


Q pIsfState 


past rq 


Float32[10] 


past quantized LSF prediction error 


cILtpState 


pitchSt 


Pitch frState 


see below in this table 


tonStabState 


count 

gp 


Word 16 
Float32[7] 


count consecutive (potential) resonance frames 
pitch gain history 


Pitch frState 


TO prev subframe 


Word32 


integer, pitch lag of previous subframe 


gainQuantState 


sfO_ gcodeO 

sfO_ target_en 

sfO_ coeff 

gain_idx_ptr 
gc_predSt 
gc predUncSt 
adaptSt 


Float32 

Float32 

Float32 [5] 

Word 16* 
gc_predState 
gc_predState 
GainAdaptState 


subframe 0/2 codebook gain 

subframe 0/2 target energy 

subframe 0/2 energy coefficient 

pointer to gain index value in parameter frame 
see below in this table 
see below in this table 
see below in this table 


gc_predState 


past_qua_en 


Float32[4] 


IVIA predictor memory (20*log10(pred. error)) 


GainAdaptState 


onset 

prev_alpha 
prev_gc 
Itpg mem 


Word 16 
Float32 
Float32 
Float32 [5] 


onset counter 
previous adaptor output 
previous codebook gain 
pitch gain history 


pitchOLWghtState 


old_TO_med 

ada_w 

wghtjig 


Word32 
Float32 
Word 16 


weighted open loop pitch lag 

weigthing level depeding on open loop pitch gain 

switches lag weighting on and off 
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Table 8: Speech decoder static variables 



struct name 


Variable 


Type[Length] 


Description 


Speech_Decode_FrameSt 
ate 


decoder_amrState 


Decoder_amrState 


see below in tills table 


post_state 


Post_FilterState 


see below in this table 




postHP_state 


Post ProcessState 


see below in this table 


Decoder_amrState 


old_exc 


Word32[194] 


excitation vector 




exc 


Word32* 


current excitation 




lsp_old 


Word32[10] 


LSP vector of previous frame 




mem_syn 


Word32[10] 


synthesis filter memory 




sharp 


Word32 


pitch sharpening gain 




old_TO 


Word32 


pitch sharpening lag 




prev_bf 


Word 16 


previous value of "bad frame" flag 




prev_pdf 


Word 16 


previous value of "pot. dangerous frame" flag 




state 


Word 16 


ECU state (0..6) 




excEnergyHist 


Word32[9] 


excitation energy history 




TOJagBuff 


Word32 


received pitch lag for ECU 




inBackgroundNoise 


Word32 


background noise flag 




voicedHangover 


Word32 


hangover flag 




ItpGainHistory 


Word32[9] 


pitch gain history 




background_state 


Bgn_scdState 


see below in this table 




Cb_gain_averState 


Gb_gain_averageState 


see below in this table 




Isp avg St 


lsp_avgState 


see below in this table 




IsfState 


D_plsfState 


see below in this table 




ec_gain_p_st 


ec_gain_pitchState 


see below in this table 




ec_gain_c_st 


ec_gain_codeState 


see below in this table 




pred_state 


gc predState 


see table 7 




nodataSeed 


Word 16 


seed for ON generator 




ph_disp_st 


ph_dispState 


see below in this table 




dtxDecoderState 


dtx decState 


see below in this table 


dtx_decState 


since_last_sid 


Word 16 


number of frames since last SID frame 




true_sid_period_inv 


Word 16 


inverse of true SID update rate 




log_en 


Word32 


logarithmic frame energy 




old_log_en 


Word32 


previous value of log_en 




pn_seed_rx 


Word32 


random number generator seed 




Isp 


Word32[10] 


LSP vector 




lsp_old 


Word32[10] 


previous LSP vector 




lsf_hist 


Word32[80] 


LSF vector history (8 frames) 




lsf_hist_ptr 


Word 16 


index to beginning of LSF history 




lsf_hist_mean 


Word32[80] 


mean-removed LSF history (8 frames) 




log_pg_nnean 


Word 16 


mean-removed logarithmic prediction gain 




log_en_hist 


Word32[8] 


logarithmic frame energy history 




log_en_hist_ptr 


Word 16 


index to beginning of log, frame energy history 




log_en_adjust 


Word 16 


mode-dependent frame energy adjustment 




dtxHangoverCount 


Word 16 


counts down in hangover period 




decAnaElapsedCount 


Word 16 


counts elapsed speech frames after DTX 




sidjrame 


Word 16 


flags SID frames 




valid_data 


Word 16 


flags SID frames containing valid data 




dtxHangoverAdded 


Word 16 


flags hangover period at end of speech 




dtxGlobalState 


enum DTXStateType 


DTX state flags 




data updated 


Word 16 


flags CNI updates 


Bgn_scdState 


frameEnergyHist 


Word32[60] 


history of synthesis frame energy 




bgHangover 


Word 16 


number of frames since last speech frame 


Cb_gain_averageState 


cbGainHistory 


Word32[7] 


codebook gain history 




hangVar 


Word 16 


counts length of talkspurt in subframes 




hangCount 


Word 16 


number of subframes since last talkspurt 


Isp avgState 


lsp_meanSave 


Word32[10] 


averaged LSP vector 


D_plsfState 


past_r_q 


Word32[10] 


past quantized LSF prediction vector 




past_lsf_q 


Word32[10] 


past dequantized LSF vector 


ec_gain_pitchState 


pbuf 


Word32[5] 


pitch gain history 




past_gain_pit 


Word32 


previous pitch gain (limited to 1 .0) 




prev_gp 


Word32 


previous good pitch gain 


ec_gain_codeState 


gbuf 


Word32[5] 


codebook gain history 




past_gain_code 


Word32 


previous codebook gain 




prev_gc 


Word32 


previous good codebook gain 


ph_dispState 


gainMem 


Word32[5] 


pitch gain history 




prevState 


Word32 


previously used impulse response 




prevCbGain 


Word32 


previous codebook gain 




lockFull 


Word 16 


force maximum phase dispersion 




onset 


Word 16 


onset counter 


Post_FilterState 


res2 


Word32[40] 


LP residual 




mem_syn_pst 


Word32[10] 


synthesis filter memory 




synth_buf 


Word16[170] 


synthesis filter work area 




agc_state 


agcState 


see below in this table 




preemph_state 


preemphasisState 


see below in this table 
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Struct name 


Variable 


Type[Length] 


Description 


agcState 


past gain 


Word 16 


past ago gain 


preemphasisState 


mem_pre 


Word 16 


filter state 


Post_ProcessState 


y2 hi 
y2 lo 
y1 hi 
y1 lo 
xO 
x1 


Word32 
Word32 
Word32 
Word32 
Word32 
Word32 


filter state, upper word 
filter state, lower word 
filter state, upper word 
filter state, lower word 
filter state 
filter state 



Homing procedure 



The principles of the homing procedures are described in 3GPP TS 06.090 [2]. This specification only includes a 
detailed description of the 8 decoder homing frames. For each AMR codec mode, the corresponding decoder homing 
frame has a fixed set of speech parameters shown in table 9a-9h. The bit allocation within these parameters is identical 
to the corresponding bit allocation of the source encoder output parameters given in 3GPP TS 06.090 [2]. 

In the following tables, the following naming convention is used for the individual parameters. Letters in italics indicate 
numbers. 

LPC_nindex of nth LSF submatrix 

LTP-LAG m adaptive codebook index for subframe m 

LTP-GAIN m adaptive codebook gain index in subframe m 

FCB-GAIN m fixed codebook gain index in subframe m 

GAIN_VQ m codebook gain VQ index in subframe m (subframe m and m+1 for MR475) 

POS m_n position index of nth pulse in subframe m 

POS m_n_k position index of nth and Mi pulse in subframe m 

POS m_n_k_lj position index of nth, A:th, /th, and jth pulse in subframe m 

SIGN m_n_k sign information for nth and ^h pulse in subframe m 

SIGN m_ ti_k_l Jsign information for nth, kth, Ith, andyth pulse in subframe m 

SlGN_m_n_k_POS_m_n sign information for nth and A:th pulse and position index for nth pulse in 
subframe m 
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Table 9a: Parameter values for the decoder homing frame (MR475) 



Parameter 


Value (LSB=bO) 


LPC1 


0x00 F8 


LPC2 


0x009D 


LPC3 


0x001 C 


LIP-LAG 1 


0x0066 


POS 1 1 2 


0x0000 


SIGN 1 1 2 


0x0003 


GAIN-VQ 1 


0x0028 


LIP-LAG 2 


0x000 F 


POS 2 1 2 


0x0038 


SIGN 2 1 2 


0x0001 


LIP-LAG 3 


0x000 F 


POS 3 1 2 


0x0031 


SIGN 3 1 2 


0x0002 


GAIN-VQ 3 


0x0008 


LIP-LAG 4 


0x000 F 


POS 4 1 2 


0x0026 


SIGN 4 1 2 


0x0003 



Table 9b: Parameter values for the decoder homing frame (MR515) 



Parameter 


Value (LSB=bO) 


LPG1 


0x00 F8 


LPC2 


0x009D 


LPG3 


0x001 


LIP-LAG 1 


0x0066 


POS 1 1 2 


0x0000 


SIGN 1 1 2 


0x0003 


GAIN-VQ 1 


0x0037 


LIP-LAG 2 


0x000 F 


POS 2 1 2 


0x0000 


SIGN 2 1 2 


0x0003 


GAIN-VQ 2 


0x0005 


LIP-LAG 3 


0x000 F 


POS 3 1 2 


0x0037 


SIGN 3 1 2 


0x0003 


GAIN-VQ 3 


0x0037 


LIP-LAG 4 


0x000 F 


POS 4 1 2 


0x0023 


SIGN 4 1 2 


0x0003 


GAIN-VQ 4 


0x001 F 
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Table 9c: Parameter values for the decoder homing frame (MR59) 



Parameter 


Value (LSB=bO) 


LPC 1 


0x00 F8 


LPC2 


0x00E3 


LPC 3 


0x002F 


LIP-LAG 1 


OxOOBD 


POS 1 1 2 


0x0000 


SIGN 1 1 2 


0x0003 


GAIN-VQ 1 


0x0037 


LIP-LAG 2 


0x000 F 


POS 2 1 2 


0x0001 


SIGN 2 1 2 


0x0003 


GAIN-VQ 2 


0x000 F 


LIP-LAG 3 


0x0060 


POS 3 1 2 


0x00F9 


SIGN 3 1 2 


0x0003 


GAIN-VQ 3 


0x0037 


LIP-LAG 4 


0x000 F 


POS 4 1 2 


0x0000 


SIGN 4 1 2 


0x0003 


GAIN-VQ 4 


0x0037 



Table 9d: Parameter values for the decoder homing frame (MR67) 



Parameter 


Value (LSB=bO) 


LPG1 


0x00 F8 


LPC 2 


0x00E3 


LPC 3 


0x002F 


LIP-LAG 1 


OxOOBD 


POS 112 3 


0x0002 


SIGN 112 3 


0x0007 


GAIN-VQ 1 


0x0000 


LIP-LAG 2 


OxOOOF 


POS 2 1 2 3 


0x0098 


SIGN 2 12 3 


0x0007 


GAIN-VQ 2 


0x0061 


LIP-LAG 3 


0x0060 


POS 3 1 2 3 


0x05C5 


SIGN 3 12 3 


0x0007 


GAIN-VQ 3 


0x0000 


LIP-LAG 4 


OxOOOF 


POS 4 1 2 3 


0x0318 


SIGN 4 12 3 


0x0007 


GAIN-VQ 4 


0x0000 
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Table 9e: Parameter values for the decoder homing frame (MR74) 



Parameter 


Value (LSB=bO) 


LPC 1 


0x00 F8 


LPC2 


0x00E3 


LPC 3 


0x002F 


LIP-LAG 1 


OxOOBD 


POS 112 3 4 


0x0006 


SIGN 112 3 4 


OxOOOF 


GAIN-VQ 1 


0x0000 


LTP-LAG 2 


0x001 B 


POS 2 12 3 4 


0x0208 


SIGN 2 12 3 4 


OxOOOF 


GAIN-VQ 2 


0x0062 


LTP-LAG 3 


0x0060 


POS 3 12 3 4 


0x1 BA6 


SIGN 3 12 3 4 


OxOOOF 


GAIN-VQ 3 


0x0000 


LTP-LAG 4 


0x001 B 


POS 4 12 3 4 


0x0006 


SIGN 4 12 3 4 


OxOOOF 


GAIN-VQ 4 


0x0000 



Table 9f : Parameter values for the decoder homing frame (MR795) 



Parameter 


Value (LSB=bO) 


LPG1 


0x00C2 


LPC 2 


0x00E3 


LPC 3 


0x002F 


LTP-LAG 1 


OxOOBD 


POS 112 3 4 


0x0006 


SIGN 112 3 4 


OxOOOF 


LTP-GAIN 1 


OxOOOA 


FCB-GAIN 1 


0x0000 


LTP-LAG 2 


0x0039 


POS 2 12 3 4 


0x1 COB 


SIGN 2 12 3 4 


0x0007 


LTP-GAIN 2 


OxOOOA 


FCB-GAIN 2 


OxOOOB 


LTP-LAG 3 


0x0063 


POS 3 12 3 4 


0x1 1A6 


SIGN 3 12 3 4 


OxOOOF 


LTP-GAIN 3 


0x0001 


FCB-GAIN 3 


0x0000 


LTP-LAG 4 


0x0039 


POS 4 12 3 4 


0x09A0 


SIGN 4 12 3 4 


OxOOOF 


LTP-GAIN 4 


0x0002 


FCB-GAIN 4 


0x0001 
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Table 9g: Parameter values for the decoder homing frame (MR102) 



Parameter 


Value (LSB=bO) 


LPC1 


0x00 F8 


LPC2 


0x00E3 


LPC3 


0x002F 


LIP-LAG 1 


0x0045 


SIGN 1 1 5 


0x0000 


SIGN 1 2 6 


0x0000 


SIGN 1 3 7 


0x0000 


SIGN 1 4 8 


0x0000 


POS 112 5 


0x0000 


POS 13 6 7 


0x0000 


POS 1 4 8 


0x0000 


GAIN-VQ 1 


0x0000 


LIP-LAG 2 


0x001 B 


SIGN 2 1 5 


0x0000 


SIGN 2 2 6 


0x0001 


SIGN 2 3 7 


0x0000 


SIGN 2 4 8 


0x0001 


POS 2 12 5 


0x0326 


POS 2 3 6 7 


OxOOCE 


POS 2 4 8 


0x007E 


GAIN-VO 2 


0x0051 


LIP-LAG 3 


0x0062 


SIGN 3 1 5 


0x0000 


SIGN 3 2 6 


0x0000 


SIGN 3 3 7 


0x0000 


SIGN 3 4 8 


0x0000 


POS 3 1 2 5 


0x01 5A 


POS 3 3 6 7 


0x0359 


POS 3 4 8 


0x0076 


GAIN-VO 3 


0x0000 


LIP-LAG 4 


0x001 B 


SIGN 4 1 5 


0x0000 


SIGN 4 2 6 


0x0000 


SIGN 4 3 7 


0x0000 


SIGN 4 4 8 


0x0000 


POS 4 12 5 


0x01 7C 


POS 4 3 6 7 


0x0215 


POS 4 4 8 


0x0038 


GAIN-VO 4 


0x0030 
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Table 9h: Parameter values for the decoder homing frame (MR122) 



Parameter 


Value (LSB=bO) 


LPC1 


0x0004 


LPC2 


0x002A 


LPC3 


OxOODB 


LPC4 


0x0096 


LPC5 


0x002A 


LIP-LAG 1 


0x0156 


LIP-GAIN 1 


OxOOOB 


SIGN 1 1 6 POS 1 1 


0x0000 


SIGN 1 2 7 POS 1 2 


0x0000 


SIGN 1 3 8 POS 1 3 


0x0000 


SIGN 1 4 9 POS 1 4 


0x0000 


SIGN 1 5 10 POS 1 5 


0x0000 


P0S1 6 


0x0000 


P0S1 7 


0x0000 


P0S1 8 


0x0000 


P0S1 9 


0x0000 


POS 1 1 


0x0000 


FCB-GAIN 1 


0x0000 


LIP-LAG 2 


0x0036 


LIP-GAIN 2 


OxOOOB 


SIGN 2 1 6 POS 2 1 


0x0000 


SIGN 2 2 7 POS 2 2 


OxOOOF 


SIGN 2 3 8 POS 2 3 


OxOOOE 


SIGN 2 4 9 POS 2 4 


OxOOOC 


SIGN 2 5 10 POS 2 5 


OxOOOD 


POS 2 6 


0x0000 


POS 2 7 


0x0001 


POS 2 8 


0x0005 


POS 2 9 


0x0007 


POS 2 10 


0x0001 


FCB-GAIN 2 


0x0008 


LIP-LAG 3 


0x0024 


LIP-GAIN 3 


0x0000 


SIGN 3 1 6 POS 3 1 


0x0001 


SIGN 3 2 7 POS 3 2 


0x0000 


SIGN 3 3 8 POS 3 3 


0x0005 


SIGN 3 4 9 POS 3 4 


0x0006 


SIGN 3 5 10 POS 3 5 


0x0001 


POS 3 6 


0x0002 


POS 3 7 


0x0004 


POS 3 8 


0x0007 


POS 3 9 


0x0004 


POS 3 10 


0x0002 


FCB-GAIN 3 


0x0003 


LIP-LAG 4 


0x0036 


LIP-GAIN 4 


OxOOOB 


SIGN 4 1 6 POS 4 1 


0x0000 


SIGN 4 2 7 POS 4 2 


0x0002 


SIGN 4 3 8 POS 4 3 


0x0004 


SIGN 4 4 9 POS 4 4 


0x0000 


SIGN 4 5 10 POS 4 5 


0x0003 


POS 4 6 


0x0006 


POS 4 7 


0x0001 


POS 4 8 


0x0007 


POS 4 9 


0x0006 


POS 4 10 


0x0005 


FCB-GAIN 4 


0x0000 
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6 File formats 

This section describes the file formats used by the encoder and decoder programs. The test sequences defined in [2] also 
use the file formats described here. 

6.1 Speech file (encoder input / decoder output) 

Speech files read by the encoder and written by the decoder consist of 16-bit words where each word contains a 13-bit, 
left aligned speech sample. The byte order depends on the host architecture (e.g. MSByte first on SUN workstations, 
LSByte first on PCs etc.). Both the encoder and the decoder program process complete frames (of 160 samples) only. 

This means that the encoder will only process n frames if the length of the input file is n'*160 + k words, while the 
files produced by the decoder will always have a length of n'*160 words. 

6.2 Mode control file (encoder input) 

The encoder program can optionally read in a mode control file which specifies the encoding mode for each frame of 
speech processed. The file is a text file containing one line per speech frame. Each line contains one of the mode names 
from the hst {MR475, MR515, MR59, MR67, MR74, MR795, MR102, MR122}. 

6.3 Parameter bitstream file (encoder output / decoder input) 

The files produced by the speech encoder/expected by the speech decoder contain an arbitrary number of frames in the 
format described in RFC 3267 [9], sections 5.1 and 5.3. 

By using preprocessor definition encoder/decoder can optionally use AMR Interface Format 2. The format is described 
in TS 26.101 [8] Annex A. 

By using another preprocessor definition encoder/decoder can optionally use format compatible with the existing AMR 
fixed-point C-code. Frame format is following. 



FRAME TYPE B1 B2 ... B244 MODE INFO unusedl ... unused4 



Each box corresponds to one Wordl 6 value in the bitstream file, for a total of 250 words or 500 bytes per frame. The 
fields have the following meaning: 

FRAME_TYPE transmit frame type, which is one of 
TX_SPEECH (0x0000) 
TX_SID_FIRST (0x0001) 
TX_SID_UPDATE (0x0002) 
TX_NO_DATA (0x0003) 

B0...B244 speech encoder parameter bits (i.e. the bitstream itself)- Each Bx either has the value 0x0000 

or 0x0001. Only mode MR122 really uses all 244 bits; for the other modes, only the first n 
bits are used (35 < n < 204). The remaining bits are unused (written as 0x0000) 

MODEJNFO encoding mode information, which is one of 
MR475 (0x0000) 
MR515 (0x0001) 
MR59 (0x0002) 
MR67 (0x0003) 
MR74 (0x0004) 
MR795 (0x0005) 
MR102 (0x0006) 
MR122 (0x0007) 

unusedl ... 4 unused, written as 0x0000 

As indicated in section 6.1 above, the byte order depends on the host architecture. 
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Annex A (informative): 
Change History 



TSG 

SA# 


Tdoc 


CR 


Rev 


Cat 


PH 


Vers 


New 
Vers 


Subject 


10 


SP-000577 002 

1 1 


A Rel-4 3.0.0 4.0.0 AMR Core Frame bit ordering (AMR speech Codec; Floating 
.point C-Code 


12 


SP-010306 


004 




A 


Rel-4 4.0.0 


4.1.0 


Limiting predicted codebook gain computing in encoder 


12 


SP-010306 


006 




A 


Rel-4 4.0.0 


4.1.0 


Correction of decoder operation in error concealment of lost 
frames 


12 


SP-010306 


008 




A 


Rel-4 4.0.0 


4.1.0 


Correction of mode state bug in AMR decoder 


\T2 


SP-010306 012 




A 


Rel-4 4.0.0 


4.1.0 


Correction of decoder Reset 


12 


SP-010306 


014 




A 


Rel-4 


4.0.0 


4.1.0 


Correction of comfort noise parameter interpolation bug of AMR 
decoder 


12 


SP-010306 


016 




A 


Rel-4 


4.0.0 


4.1.0 


Correction of the TX TYPE and RX TYPE identifiers 




MCC 








Rel-4 


4.1.0 


4.1.1 


Correction of bugs in code 


13 'SP-010452 010 




A 


Rel-4 


4.1 .1 4.2.0 Correction to make encoder and decoder memories 
independent 


13 


SP-010452 


018 




A 


Rel-4 


4.1.1 


4.2.0 


Correction of decoder operation in error concealment of lost 
frames 


15 


SP-020079 


019 




A 


Rel-4 


4.2.0 


4.3.0 


Maintaining bit-exactness with TS 26.073 


16 












'5.0.0 Version for Release 5 


19 


SP-030088 


21 


1 


F 


Rel-5 


5.0.0 5.1.0 MMS compatible i/o format option 


19 


SP-030088 


24 




A 


Rel-5 5.0.0 J 


5.1.0 


Correction to floating-point implementation of sp dec.c 


20 


SP-030214 |26 




A 


Rel-5 1 5.1.0 


5.2.0 


Correction on codec mode handling during DTX 
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